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S (57) Abstract: A DNA construct comprising in the 5' to 3' direction of transcription operably linked a promoter region directing 
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*^ for keto group containing xanthophyll production and esterification in an oilseed plant and a transcriptional termination region is 
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^ cell, e.g. of rape, sunflower, soybean or mustard origin, and a transgenic oilseed plant-produced xanthophyll, such as canthaxanthin 
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DNA construct and its use. 

The present invention relates to a new DNA construct for transformation into oilseed 
plants. The DNA construct comprises nucleotide sequences encoding peptides with enzyme 
5 activities necessary for the high-level production and esterification of keto group-containing 
xanthophylls in oilseed plants. 

Background of the invention 

Carotenoids are produced de novo by plants, fungi, algae and some bacteria. A 
number of biosynthetic steps are needed for the biological production of the carotenoids. 
10 There are two chemically different groups of carotenoids, namely carotenes containing only 
carbon and hydrogen molecules and xanthophylls containing oxygen in the molecule in 
addition to carbon and hydrogen. 

The xanthophylls, and particularly astaxanthin (3,3'-dihydroxy-p-P-carotene-4,4'- 
dione), are often colored pigments and are used as such or as anti-oxidants. 
15 Carotenes are biological precursors for the production of the oxygen-containing 

xanthophylls. There are two types of enzymes responsible for the introduction of hydroxy 
groups and keto groups into the carotenes, namely hydroxylases and ketolases, respectively. 

The keto group-containing xanthophyll astaxanthin, which has keto and hydroxy 
groups, is biosynthetically produced from beta-carotene. 
20 Large-scale production of xanthophylles from natural sources is at present performed 

by AstaCarotene AB, Gustavsberg, Sweden, by cultivation of the alga Haematococcus 
pluvialis for the production of astaxanthin in esterified form. 

It would be desirable to be able to produce keto group-containing xanthophylls 
particularly astaxanthin, in oilseed plants. Oilseed plants have naturally P-carotene 
25 hydroxylases but lack p-carotene C-4-oxygenase enzymes or ketolases. 

Description of the invention 
The present invention provides DNA constructs enabling and promoting 
production of keto group containing xanthophylls, especially astaxanthin, in oilseed plants, 
such as rape, sunflower, soybean and mustard. The DNA construct is transformed into the 
30 oilseed plant cell for expression of a protein or fused protein which has an enzyme activity 
enabling keto group insertion into a carotene or hydroxy carotene for the biosynthetic 
production of a keto group containing xanthophyll, such as cantaxanthin (P,p-carotene-4,4'- 
dione) and/or astaxanthin. Use is thus made of the biosynthetic pathway of the oilseed plant to 
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produce carotenoids. The naturally occurring synthesis of carotenoids involves a number of 
enzymes, namely 1-D-deoxyxylulose 5-phosphate synthase, isopentenyl 
pyrophosphate:dimethylallyl pyrophosphate isomerase, geranylgeranyl pyrophosphate 
synthase, phytoene synthase, phytoene desaturase, zeta-carotene desaturase, lycopene beta- 
5 cyclase, P-carotene hydroxylase, and P-carotene C-4-oxygenase. Genes coding for peptides 
having these enzymatic activities may be inserted into the DNA construct of the invention, 
one or several per construct, to promote high-level production in the transgenic oilseed plant. 
In case only one enzyme coding gene is inserted per plant, two or more plants may be 
sexually interbred to produce plants containing all the desired enzyme activities. 

10 Thus, the present invention is directed to a DNA construct comprising in the 5' to 3' 

direction of transcription operably linked a promoter region directing transcription to the seed 
of an oilseed plant, a nucleotide sequence coding for at least one peptide with enzyme activity 
necessary for keto group containing xanthophyll production and esterification in an oilseed 
plant and a transcriptional termination region. 

15 In a preferred embodiment of the invention the DNA construct additionally 

comprises between the promoter region and the nucleotide sequence coding for at least one 
peptide with enzyme activity a nucleotide sequence coding for a transit peptide directing the 
translated fusion polypeptide to the chloroplast of the oilseed plant. 

The DNA construct is preferably such that the promoter is a napin promoter, the 

20 peptide with enzyme activity necessary for keto group containing xanthophyll production is 
selected from the group consisting of peptides with 1-D-deoxyxylulose 5-phosphate synthase, 
isopentenyl pyrophosphaterdimethylallyl pyrophosphate isomerase, geranylgeranyl 
pyrophosphate synthase, phytoene synthase, phytoene desaturase, zeta-carotene desaturase, 
lycopene beta-cyclase, P-carotene hydroxylase, and P-carotene C-4-oxygenase activity. To 

25 promote esterification of astaxanthin a nucleotide sequence coding for a peptide with acyl 
transferase activity may be included in the group. 

In a preferred embodiment of the DNA construct according to the invention the 
nucleotide sequence coding for a peptide with enzyme activity is a nucleotide sequence 
coding for a N-terminally truncated P-carotene C-4-oxygenase gene from the alga 

30 Haematococcus pluvialis. 

An example of the DNA construct of the invention is presented in the sequence 
listing as SEQ ID NO:l and in Fig.l. 



WO 01/20011 PCT/SE00/01767 



The present invention is also directed to a transgenic oilseed plant cell 
comprising the DNA construct of the invention, and preferably the oilseed plant is selected 
from the group consisting of rape, sunflower, soybean and mustard. 

The invention is additionally directed to transgenic oilseed plant-produced 
5 xanthophyll, e.g. canthaxanthin and astaxanthin. 

A preferred aspect of the invention is directed to transgenic oilseed plant- 
produced astaxanthin esters. 

The present invention will now be illustrated with reference to the DNA 
construct disclosed in the sequence listing and in Fig.l, and the following description of 
10 embodiments. However, the invention is not limited to these exemplifications. 

Short description of the drawings 
Fig.l illustrates the nucleotide sequence of the DNA construct comprising the napin promoter, 
the chloroplast localization signal, the N-terminally truncated p-carotene C-4-oxygenase gene 
and the termination sequence, and the deduced amino acid sequences of the transit peptide 
1 5 and the p-carotene C-4-oxygenase. 

Description of embodiments 
The invention is illustrated by production of astaxanthin in the seed of oilseed 
rape. The astaxanthin produced in the seed of the transgenic plant is extracted as part of the 
extracted oil. By use of conventionally used protocols for Agrobacterium tumefaciens 
20 mediated transformation such as described by (Hoekema et al. 1 983, An et al. 1 986, Fry et al. 
1987, DeBlock et al. 1988, Radke et al.1988, or Moloney et al. 1989) transgenic plants are 
produced having a chimeric DNA construct that is genetically inherited and is able to produce 
astaxanthin. The nucleotide sequence of the chimeric DNA construct consist of four parts of 
different genetic origin namely: (1) a promoter, (2) a localization signal, (3) a p-carotene C-4- 
25 oxygenase coding region and (4) a termination sequence. 

The napin promoter directs transcription to the seed of oilseed rape (Stalberg et 
al 1996). This promoter was coupled to a localization signal similar but not identical to a 
transit peptide (TP) of Rbcsla (Krebbers, 1988) that directs the translated product of a fused 
gene to the chloroplast. The promoter and the TP sequence were ligated to a part of the coding 
30 sequence of a ketolase gene BCK (Kajiwara et al. 1 995). This enzyme oxygenates p-carotene 
to canthaxanthin, (Fraser et al. 1997). The chimeric DNA construct was then coupled to a 
suitable termination sequence, e.g. that of the Agrobacterium tumefaciens nopaline synthase 
gene (the nos 3' end)(Bevan et al. 1983), as illustrated in Fig.l. 
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Cellular storage of Astaxantin 

The storage of large amounts of free astaxanthin in plants will be difficult due to 
toxic effects of the molecule as it intercalates in the plant membranes. An effective 
esterification of astaxanthin to fatty acids enables storage of the esterified molecules in 
triacylglycerol containing oleosomes. Thus, an acyl transferase can be claimed to be of 
fundamental importance for the process, as is proteins that can mediate transport of different 
forms of astaxanthin from the chloroplast to the vesicles. 

Sequences and oligonucleotide s used in the construction of the DNA construct 

1. Napin promoter (GeneBank ACCESSION No. J02798) 

This promoter sequence, a 1 145 base pair fragment including the 5* leader 
sequence has a unique Hindlll site at the 5' end. The 3' end was synthesized with an 
additionally 6 nucleotide BamHI site. 

2. Transit peptide similar to RBCSla (GeneBank ACCESSION No. XI 3611. XI 4565) 

The transit peptide (TP) was amplified by PCR from -28 to the end of the transit 
cleavage aa=54/55 site of the Rbcsla gene. The 5' end was synthesized with a BamHI site 
and similarly the 3' sequence was synthesized with a Xbal site. The two following 
oligonucleotides were used for the PCR amplification. 

BamHI 

5 ' primer: TP 1 5 ' AGAC GGATCC TCAGTC ACACAAAGAGTA 3 ' 

Sad Xbal 

3 ' primer: TP2 5 'GTTC GAGCTC TCTAG A CATGCAGTTAACGC 3 ' 

3. BCK (fi-carotene C-4 oxygenase) (Genebank ACCESSION No. D45881) 

The BCK fragment was amplified by PCR including a 5* Xbal site and was 
ligated to the TP already described. The 5' primer (BCK1) used for PCR, is homologous to 
the BCK sequence from nucleotide 264 and the 3' oligonucleotide (Ax40) ends with a stop 
codon and was synthesized with a Sad restriction site for cloning. The synthesized fragment 
was fused to the TP as shown in Fig 1 . 
Oligonucleotides used for PCR: 

Xbal 

5 ' primer: BCK1 5 'ACAG TCTAGA ATGCCATCCGAGTCGTCA 3 ' 

Sad 

3 primer: AX40 5 'CACCGAGCTCCATGACACTCTTGTGCAGA 3 ' 
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Description of SEQ ID NO:l and SEQ ID NO:2 

The sequences shown i Fig.l are the same as the two sequences which are 
shown in the sequence listing. 

The SEQ ID NO:l is a nucleotide sequence composed of the following features: 
5 Nucleotide No. 

Cloning site Hindm 1 -6 

Napin Promoter 1 - 1 1 45 

Cloning site BamHI 1 146-1 151 

Transit peptide leader 1 1 52-1 1 78 

1 0 Transit peptide coding 1 1 79- 1 347 

Cloning site Xbal 1348-1353 

P-carotene C-4-oxygenase 1 3 54-22 1 7 

P-carotene C-4-oxygense 3' untranslated 2218-2266 

Cloning site Sad 2267-2272 
15 Nopaline synthetase termination 2273-2536 

Cloning site EcoRI 2538-2543 



The SEQ ID NO: 2 is a deduced amino acid sequence of the fusion protein of 
the transit peptide and the peptide with P-carotene C-4-oxygenase activity. 
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Claims 

1. A DNA construct comprising in the 5' to 3' direction of transcription operably 
linked a promoter region directing transcription to the seed of an oilseed plant, a nucleotide 
sequence coding for at least one peptide with enzyme activity necessary for keto group 

S containing xanthophyll production and esterification in an oilseed plant and a transcriptional 
termination region. 

2. The DNA construct according to claim 1, which between the promoter region 
and the nucleotide sequence coding for at least one peptide with enzyme activity additionally 
comprises a nucleotide sequence coding for a transit peptide directing the translated fusion 

10 polypeptide to the chloroplast of the oilseed plant. 

3. The DNA construct according to claim 1 or 2, wherein the promoter is a napin 
promoter, the peptide with enzyme activity necessary for keto group containing xanthophyll 
production and esterification is selected from the group consisting of peptides with, 1-D- 
deoxyxylulose 5-phosphate synthase, isopentenyl pyrophosphaterdimethylallyl pyrophosphate 

15 isomerase, geranylgeranyl pyrophosphate synthase, phytoene synthase, phytoene desaturase, 
zeta-carotene desaturase, lycopene beta-cyclase, p-carotene hydroxylase, P-carotene (lip- 
oxygenase, and acyl transferase activity. 

4. The DNA construct according to any one of claims 1-3, wherein the 
nucleotide sequence coding for a peptide with enzyme activity is a nucleotide sequence 

20 coding for a N-terminally truncated P-carotene C-4-oxygenase gene from the alga 
Haematococcus pluvialis. 

5. The DNA construct according to claim 4, wherein the nucleotide sequence is 
SEQ ID NO:l. 

6. Transgenic oilseed plant cell comprising the DNA construct of any one of 

25 claims 1-5 . 

7. Transgenic oilseed plant cell according to claim 6, wherein the oilseed plant is 
selected from the group consisting of rape, sunflower, soybean and mustard. 

8. Transgenic oilseed plant-produced xanthophyll. 

9. Transgenic oilseed plant-produced xanthophyll according to claim 8, wherein 
30 the xanthophyll is canthaxanthin 

10. Transgenic oilseed plant-produced xanthophyll according to claim 8, 
wherein the xanthophyll is astaxanthin. 

11. Transgenic oilseed plant-produced xanthophyll according to claim 8, 
wherein the xanthophyll is astaxanthin esters. 
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1/3 

Napin promoter 

AAGCTTTCTTCATCGGTGATTGATTCCTTTAAAGACTTATGTTTCTTATCTTGCTTCTGA 

GGCAAGTATTCAGTTACCAGTTACCACTTATATTCTGGACTTTCTGACTGCATCCTCATT 

TTTCCAACATTTTAAATTTCACTATTGGCTGAATGCTTCTTCTTTGAGGAAGAAACAATT 

CAGATGGCAGAAATGTATCAACCAATGCATATATACAAATGTACCTCTTGTTCTCAAAAC 

ATCTATCGGATGGTTCCATTTGCTTTGTCATCCAATTAGTGACTACTTTATATTATTCAC 

TCCTCTTTATTACTATTTTCATGCGAGGTTGCCATGTACATTATATTTGTAAGGATTGAC 

GCTATTGAGCGTTTTTCTTCAATTTTCTTTATTTTAGACATGGGTATGAAATGTGTGTTA 

GAGTTGGGTTGAATGAGATATACGTTCAAGTGAAGTGGCATACCGTTCTCGAGTAAGGAT 

GACCTACCCATTCTTGAGACAAATGTTACATTTTAGTATCAGAGTAAAATGTGTACCTAT 

AACTCAAATTCGATTGACATGTATCCATTCAACATAAAATTAAACCAGCCTGCACCTGCA 

TCCACATTTCAAGTATTTTCAAACCGTTCGGCTCCTATCCACCGGGTGTAACAAGACGGA 

TTC CG AATTTGGAAGATTTTGACT CAAATT CC CAATTT ATATTGAC CGTG ACTAAATCAA 

CTTTAACTTCTATAATTCTGATTAAGCTCCCAATTTATATTCCCAACGGCACTACCTCCA 

AAATTTATAGACTCTCATCCCCTTTTAAACCAACTTAGTAAACGTTTTTTTTTTTAATTT 

TATGAAGTTAAGTTTTTACCTTGTTTTTAAAAAGAATCGTTCATAAGATGCCATGCCAGA 

ACATTAGCTACACGTTACACATAGCATGCAGCCGCGGAGAATTGTTTTTCTTCGCCACTT 

GTCACTCCCTTCAAACACCTAAGAGCTTCTCTCTCACAGCACACACATACAATCACATGC 

GTGCATGCATTATTACACGTGATCGCCATGCAAATCTCCTTTATAGCCTATAAATTAACT 

CATCCGCTTCACTCTTTACTCAAACCAAAACTCATCAATACAAACAAGATTAAAAACATA 

End -2 8 untranslated leader TP start 

CACGAGGATCCTCAGTCACACAAAGAGTAAAGAAGAACAATGGCTTCCTCTATGCTCTCT 

MAS S M L S 

TCCGCTACTATGGTTGCCTCTCCGGCTCAGGCCACTATGGTCGCTCCTTTCAACGGACTT 
SATMVAS PAQATMVAP FNGL 

AAGTCCTCCGCTGCCTTCCCAGCCACCCGCAAGGCTAACAACGACATTACTTCCATCACA 
KS SAAFPATRKANNDI T SIT 

FIG.l 
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TP End C-4 -Oxygenase 

AGCAACGGCGGACGCGTTAACTGCATGTCTAGAATGCCATCCGAGTCGTCAGACGCAGCT 
SNGGRVNCMSRMPSESSDAA 

CGTCCTGCGCTAAAGCACGCCTACAAACCTCCAGCATCTGACGCCAAGGGCATCACGATG 
RPALKHAYKPPASDAKGITM 

GCGCTGACCATCATTGGCACCTGGACCGCAGTGTTTTTACACGCAATATTTCAAATCAGG 
ALTIIGTWTAVFLHAIFQIR 

CTACCGACATCCATGGACCAGCTTCACTGGTTGCCTGTGTCCGAAGCCACAGCCCAGCTT 
LPTSMDQLHWLPVSEATAQL 

TTGGGCGGAAGCAGCAGCCTACTGCACATCGCTGCAGTCTTCATTGTACTTGAGTTCCTG 
LGGSSSLLHXAAVFXVIiEFIj 

TACACTGGTCTATTCATCACCACACATGACGCAATGCATGGCACCATAGCTTTGAGGCAC 
YTGIjFITTHDAMHGT xalrh 

AGGCAGCTCAATGATCTCCTTGGCAACATCTGCATATCAGTGTACGCCTGGTTTGACTAC 
R QLNDLLGNICISLYAWFDY 

AGCATGCTGCATCGCAAGCACTGGGAGCACCACAACCATACTGGCGAAGTGGGGAAAGAC 
SMLHRKHWEHHNHTG EVGKD 

CCTGACTTCCACAAGGGAAATCCCGGCCTTGTCCCCTGGTTCGCCAGCTTCATGTCCAGC 
PDFHKGNPGLVPWFASFMSS 

TACATGTCCCTGTGGCAGTTTGCCCGGCTGGCATGGTGGGCAGTGGTGATGCAAATGCTG 
YMS LWQFARLAWWAVVMQMIj 

GGGGCGCCCATGGCAAATCTCCTAGTCTTCATGGCTGCAGCCCCAATCTTGTCAGCATTC 
GAP MANLLVFMAAAP I L SAF 

CGCCTCTTCTACTTCGGCACTTACCTGCCACACAAGCCTGAGCCAGGCCCTGCAGCAGGC 
RLFYFGTYLPHKPEPGPAAG 

TCTCAGGTGATGGCCTGGTTCAGGGCCAAGACAAGTGAGGCATCTGATGTGATGAGTTTC 
SQVMAWFRAKTSEAS DVMSF 

CTGACATGCTACCACTTTGACCTGCACTGGGAGCACCACAGATGGCCCTTTGCCCCCTGG 
LTCYHFDLHWEHHRWPFAPW 

C-4 oxygenase Stop 
TGGCAGCTGCCCCACTGCCGCCGCCTGTCCGGGCGTGGCCTGGTGCCTGCCTTGGCATGA 
WQLPHCRRLSGRGLVPALA* 



FIG.l (cont. ) 
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C-4 oxygenase untranslated region Nos term 

CCTGGTCCCTCCGCTGGTGACCCAGCGTCTGCACAAGAGTGTCATGGAGCTCGAATTTCC 

CCGATCGTTCAAACATTTGGCAATAAAGTTTCTTAAGATTGAATCCTGTTGCCGGTCTTG 

CGATGATTATCATATAATTTCTGTTGAATTACGTTAAGCATGTAATAATTAACATGTAAT 

GCATGACGTTATTTATGAGATGGGTTTTTATGATTAGAGTCCCGCAATTATACATTTAAT 

ACGCGATAGAAAACAAAATATAGCGCGCAAACTAGGATAAATTATCGCGCGCGGTGTCAT 
end 

CTATGTTACTAGATCGGGAATTC 



Fig.l (cont.) 
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<110> AstaCarotene AB 



<120> DNA construct and its use 

<130> 2 92 95 -AstaCarotene 

<140> 
<141> 

<160> 2 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 2543 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: napin promoter 

+ choroplast localization signal + beta-carotene C-4 oxygenase 
coding sequence + termination sequence 



<220> 
<221> 
<222> 


promoter 
(1) . . (1145) 












<220> 
<221> 
<222> 


transit_peptide 
(1179) . . (1347) 












<220> 
<221> 
<222> 


CDS 

(1179) . . (2217) 












<220> 
<221> 
<222> 


terminator 
(2273) . . (2536) 












<400> 1 
aagctttctt 


catcggtgat 


tgattccttt 


aaagacttat 


gtttcttatc 


ttgcttctga 


60 


ggcaagtatt 


cagttaccag 


ttaccactta 


tattctggac 


tttctgactg 


catcctcatt 


120 


tttccaacat 


tttaaatttc 


actattggct 


gaatgcttct 


tctttgagga 


agaaacaatt 


180 


cagatggcag 


aaatgtatca 


accaatgcat 


atatacaaat 


gtacctcttg 


ttctcaaaac 


240 


atctatcgga 


tggttccatt 


tgctttgtca 


tccaattagt 


gactacttta 


tattattcac 


300 


tcctctttat 


tactattttc 


atgcgaggtt 


gccatgtaca 


ttatatttgt 


aaggattgac 


360 


gctattgagc 


gtttttcttc 


aattttcttt 


attttagaca 


tgggtatgaa 


atgtgtgtta 


420 


gagttgggtt 


gaatgagata 


tacgttcaag 


tgaagtggca 


taccgttctc 


gagtaaggat 


480 


gacctaccca 


ttcttgagac 


aaatgttaca 


ttttagtatc 


agagtaaaat 


gtgtacctat 


540 
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aactcaaatt cgattgacat gtatccattc aacataaaat taaaccagcc tgcacctgca 600 
tccacatttc aagtattttc aaaccgttcg gctcctatcc accgggtgta acaagacgga 660 
ttccgaattt ggaagatttt gactcaaatt cccaatttat attgaccgtg actaaatcaa 720 
ctttaacttc tataattctg attaagctcc caatttatat tcccaacggc actacctcca 780 
aaatttatag actctcatcc ccttttaaac caacttagta aacgtttttt tttttaattt 840 
tatgaagtta agtttttacc ttgtttttaa aaagaatcgt tcataagatg ccatgccaga 900 
acattagcta cacgttacac atagcatgca gccgcggaga attgtttttc ttcgccactt 960 
gtcactccct tcaaacacct aagagcttct ctctcacagc acacacatac aatcacatgc 1020 
gtgcatgcat tattacacgt gatcgccatg caaatctcct ttatagccta taaattaact 1080 
catccgcttc actctttact caaaccaaaa ctcatcaata caaacaagat taaaaacata 1140 



cacgaggatc ctcagtcaca caaagagtaa agaagaaca atg get tec tct atg 
3 SM Met Ala Ser Ser Met 

1 5 

etc tct tec get act atg gtt gec tct ccg get cag gee act atg gtc 
Leu Ser Ser Ala Thr Met Val Ala Ser Pro Ala Gin Ala Thr Met Val 
10 15 20 

get cct ttc aac gga ctt aag tec tec get gee ttc cca gee acc cgc 
Ala Pro Phe Asn Gly Leu Lys Ser Ser Ala Ala Phe Pro Ala Thr Arg 
25 30 35 

aag get aac aac gac att act tec ate aca age aac ggc gga cgc gtt 
Lys Ala Asn Asn Asp He Thr Ser He Thr Ser Asn Gly Gly Arg Val 
40 45 50 

aac tgc atg tct aga atg cca tec gag teg tea gac gca get cgt cct 
Asn Cvs Met Ser Arg Met Pro Ser Glu Ser Ser Asp Ala Ala Arg Pro 
55 60 65 

gcg eta aag cac gee tac aaa cct cca gca tct gac gee aag ggc ate 
111 Leu Lys His Ala Tyr Lys Pro Pro Ala Ser Asp Ala Lys Gly lie 
70 75 80 85 

acg atg gcg ctg acc ate att ggc acc tgg acc gca gtg ttt tta cac 
Thr Me? Ill Leu Thr He He Gly Thr Trp Thr Ala Val Phe Leu His 
90 95 100 

gca ata ttt caa ate agg eta ccg aca tec atg gac cag ctt cac tgg 
Ala He Phe Gin He Arg Leu Pro Thr Ser Met Asp Gin Leu His Trp 
105 HO 115 

ttg cct gtg tec gaa gee aca gee cag ctt ttg ggc gga age age age 
HI Pro Val Ser Glu Ala Thr Ala Gin Leu Leu Gly Gly Ser Ser Ser 
120 125 130 

eta ctg cac ate get gca gtc ttc att gta ctt gag ttc ctg tac act 
Leu Leu His He Ala Ala Val Phe He Val Leu Glu Phe Leu Tyr Thr 
135 140 1*5 



1194 



1242 



1290 



1338 



1386 



1434 



1482 



1530 



1578 



1626 
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ggt eta ttc ate ace aca cat gac gca atg cat ggc acc ata get ttg 1674 
Gly Leu Phe lie Thr Thr His Asp Ala Met His Gly Thr lie Ala Leu 
150 155 160 165 

agg cac agg cag etc aat gat etc ctt ggc aac ate tgc ata tea ctg 1722 
Arg His Arg Gin Leu Asn Asp Leu Leu Gly Asn He Cys He Ser Leu 
170 175 180 

tac gee tgg ttt gac tac age atg ctg cat cgc aag cac tgg gag cac 1770 
Tyr Ala Trp Phe Asp Tyr Ser Met Leu His Arg Lys His Trp Glu His 
185 190 195 

cac aac cat act ggc gaa gtg ggg aaa gac cct gac ttc cac aag gga 1818 
His Asn His Thr Gly Glu Val Gly Lys Asp Pro Asp Phe His Lys Gly 
200 205 210 

aat ccc ggc ctt gtc ccc tgg ttc gec age ttc atg tec age tac atg 1866 
Asn Pro Gly Leu Val Pro Trp Phe Ala Ser Phe Met Ser Ser Tyr Met 
215 220 225 

tec ctg tgg cag ttt gec egg ctg gca tgg tgg gca gtg gtg atg caa 1914 
Ser Leu Trp Gin Phe Ala Arg Leu Ala Trp Trp Ala Val Val Met Gin 
230 235 240 245 

atg ctg ggg gcg ccc atg gca aat etc eta gtc ttc atg get gca gec 1962 
Met Leu Gly Ala Pro Met Ala Asn Leu Leu Val Phe Met Ala Ala Ala 
250 255 260 

cca ate ttg tea gca ttc cgc etc ttc tac ttc ggc act tac ctg cca 2010 
Pro He Leu Ser Ala Phe Arg Leu Phe Tyr Phe Gly Thr Tyr Leu Pro 
265 270 275 

cac aag cct gag cca ggc cct gca gca ggc tct cag gtg atg gec tgg 2058 
His Lys Pro Glu Pro Gly Pro Ala Ala Gly Ser Gin Val Met Ala Trp 
280 285 290 

ttc agg gee aag aca agt gag gca tct gat gtg atg agt ttc ctg aca 2106 
Phe Arg Ala Lys Thr Ser Glu Ala Ser Asp Val Met Ser Phe Leu Thr 
295 300 305 

tgc tac cac ttt gac ctg cac tgg gag cac cac aga tgg ccc ttt gee 2154 
Cys Tyr His Phe Asp Leu His Trp Glu His His Arg Trp Pro Phe Ala 
310 315 320 325 

ccc tgg tgg cag ctg ccc cac tgc cgc cgc ctg tec ggg cgt ggc ctg 2202 
Pro Trp Trp Gin Leu Pro His Cys Arg Arg Leu Ser Gly Arg Gly Leu 
330 335 340 

gtg cct gec ttg gca tgacctggtc cctccgctgg tgacccagcg tetgeacaag 2257 
Val Pro Ala Leu Ala 
345 

agtgtcatgg agctcgaatt tccccgatcg ttcaaacatt tggcaataaa gtttcttaag 2317 
attgaatcct gttgccggtc ttgegatgat tatcatataa tttctgttga attacgttaa 2377 
gcatgtaata attaacatgt aatgeatgae gttatttatg agatgggttt ttatgattag 2437 
agtcccgcaa ttatacattt aatacgegat agaaaacaaa atatagegeg caaactagga 24 97 
taaattatcg cgcgcggtgt catctatgtt actagategg gaattc 2543 
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<210> 2 
<211> 346 
<212> PRT 

lull SS$£ B B 3T£5fic±^ Sequence: deduced fusion protein of 

transl? peptide + peptide with beta-carotene C-4 oxygenase activity 

<400> 2 

Met Ala ser Ser Met Leu Ser Ser Ala Thr Met Val Ala Ser Pro Ala 
! 5 10 15 

Gin Ala Thr Met Val Ala Pro Phe Asn Gly Leu Lys Ser Ser Ala Ala 
20 25 30 

Phe Pro Ala Thr Arg Lys Ala Asn Asn Asp He Thr Ser He Thr Ser 
35 40 45 

Asn Gly Gly Arg Val Asn Cys Met Ser Arg Met Pro Ser Glu Ser Ser 
50 * 55 60 

Asp Ala Ala Arg Pro Ala Leu Lys His Ala Tyr Lys Pro Pro Ala Ser 



65 



70 



Asp Ala Lys Gly He Thr Met Ala Leu Thr He He Gly Thr Trp Thr 



85 



Ala val Phe Leu His Ala He Phe Gin He Arg Leu Pro Thr Ser Met 



100 



105 HO 



Asp Gin Leu His Trp Leu Pro Val Ser Glu Ala Thr Ala Gin Leu Leu 
115 I 20 

Gly Gly Ser Ser Ser Leu Leu His He Ala Ala Val Phe He Val Leu 
130 135 1*° 



Glu Phe Leu Tyr Thr Gly Leu Phe He Thr Thr His Asp Ala Met His 
145 150 155 

Gly Thr lie Ala Leu Arg His Arg Gin Leu Asn Asp Leu Leu Gly Asn 
165 I 70 175 



lie Cys He Ser Leu Tyr Ala Trp Phe Asp Tyr Ser Met Leu His Arg 
180 185 I 90 

Lys His Trp Glu His His Asn His Thr Gly Glu Val Gly Lys Asp Pro 
195 200 205 

Asp Phe His Lys Gly Asn Pro Gly Leu Val Pro Trp Phe Ala Ser Phe 
210 215 220 

Met Ser Ser Tyr Met Ser Leu Trp Gin Phe Ala Arg Leu Ala Trp Trp 
225 230 235 

Ala Val Val Met Gin Met Leu Gly Ala Pro Met Ala Asn Leu Leu Val 



245 



250 
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Phe Met Ala Ala Ala Pro lie Leu Ser Ala Phe Arg Leu Phe Tyr Phe 
260 265 270 

Gly Thr Tyr Leu Pro His Lys Pro Glu Pro Gly Pro Ala Ala Gly Ser 
275 280 285 

Gin Val Met Ala Trp Phe Arg Ala Lys Thr Ser Glu Ala Ser Asp Val 
290 295 300 

Met Ser Phe Leu Thr Cys Tyr His Phe Asp Leu His Trp Glu His His 
305 310 315 320 

Arg Trp Pro Phe Ala Pro Trp Trp Gin Leu Pro His Cys Arg Arg Leu 
325 330 335 

Ser Gly Arg Gly Leu Val Pro Ala Leu Ala 
340 345 
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